AITopics | video domain generalization

Collaborating Authors

video domain generalization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Supplementary Material for " Diversifying Spatial-Temporal Perception for Video Domain Generalization " Kun-Y u Lin

Neural Information Processing SystemsFeb-16-2026, 13:38:54 GMT

Hard Norm Alignment loss (HNA): apply the HNA loss (Eq. HMDB, which demonstrates the effectiveness of our model. First, we drop feature from a specific spatial group. Method UCF HMDB STDN-T -1 59.2 STDN-T -2 58.1 STDN-T -3 59.4 STDN-T -4 58.9 Full STDN 60.2 Second, we drop feature from a space scale. In our main manuscript, we conduct all experiments based on ResNet-50.

artificial intelligence, machine learning, spatial reasoning, (12 more...)

Neural Information Processing Systems

Country: Asia > China (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.75)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.53)

Add feedback

Diversifying Spatial-Temporal Perception for Video Domain Generalization

Neural Information Processing SystemsDec-26-2025, 13:46:47 GMT

Video domain generalization aims to learn generalizable video classification models for unseen target domains by training in a source domain.A critical challenge of video domain generalization is to defend against the heavy reliance on domain-specific cues extracted from the source domain when recognizing target videos. To this end, we propose to perceive diverse spatial-temporal cues in videos, aiming to discover potential domain-invariant cues in addition to domain-specific cues. We contribute a novel model named Spatial-Temporal Diversification Network (STDN), which improves the diversity from both space and time dimensions of video data. First, our STDN proposes to discover various types of spatial cues within individual frames by spatial grouping. Then, our STDN proposes to explicitly model spatial-temporal dependencies between video contents at multiple space-time scales by spatial-temporal relation modeling. Extensive experiments on three benchmarks of different types demonstrate the effectiveness and versatility of our approach.

diversifying spatial-temporal perception, name change, video domain generalization, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

Diversifying Spatial-Temporal Perception for Video Domain Generalization

Neural Information Processing SystemsJan-19-2025, 19:30:33 GMT

diversifying spatial-temporal perception, domain-specific cue, video domain generalization, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)

Add feedback